Universality of optimal measurements. 
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We present optimal and minimal measurements on identi- 
cal copies of an unknown state of a qubit when the quality of 
measuring strategies is quantified with the gain of information 
(KuUback of probability distributions) . We also show that the 
maximal gain of information occurs, among isotropic priors, 
when the state is known to be pure. Universality of optimal 
measurements follows from our results: using the fidelity or 
the gain of information, two different figures of merits, leads 
to exactly the same conclusions. We finally investigate the op- 
timal capacity of A'^ copies of an unknown state as a quantum 
channel of information. 

PACS Nos. 03.67.-a, 03.65.Bz 

Consider an unknown state of a two-level quantum sys- 
tem described by the density matrix p{b), b being the 
Bloch vector, 6 = |6| < 1. The preparation device pro- 
vides N identical copies of the system, so that the state at 
our disposal is p{b)'^^ . In the past few years the optimal 
measuring strategy, i.e. the most successful at revealing 
the identity of the unknown state, has been obtained, first 
for pure states ||^,|^ and then for mixed states Also 
the minimal among the optimal strategies, i.e. the ones 
with the smallest number of outcomes, have been con- 
structed, both for pure states jj] and mixed states [||. 
In the processing of information contained in quantum 
states, knowing the most efficient read-out procedures, 
i.e. the optimal and least resource consuming ones, is of 
course of importance. 

In all these contributions the quality of the measuring 
strategy, characterized by a resolution of the identity 

^Af.-l, (1) 

i 

in terms of positive operators > 0, has been quanti- 
fied by the fidelity (5|. In other words, when outcome i 
(related to Mi) happens one guesses the unknown state 
to be Pi = pipi) and one quantifies the quality of the 
guess by 

F{pib),pm EE (tv[ ^p(by/2pip^)piby/2 ] ^ ' . (2) 

One can arrive at eq. (|^) from several different start- 
ing points. One of them is based on a measure of distin- 
guishability of the probability distributions associated to 



p and p' by performing general positive operator valued 
measurements (as in eq. (|l|)) on them Q and minimizing, 

F{p,p') = min ^TiipM,]^TT[p'M,]^ . (3) 

Another is based on the standard Hilbert space scalar 
product of the two pure states which belonging to C^®C^ 
lead to p and p' when reduced Q , 

^^(p,p') = max 1(^1^') (4) 

where maximization is performed over {lip) , [ip')} / p = 

TiaUMl p' = TraU')m- 

These equivalent definitions of the fidelity, plus the fol- 
lowing properties which characterize it further, make it 
a unique quantification of the comparison of two general 
quantum states: 

1.0<F{p,p')^F{p',p)<l. 

2. F(p,p') = l ^ P = P'; F{p,p') = Q ^ pp' = Q. 

3. F{Up U\ Up'W) = F(p, p'), UW = WU = 1. 

4. F{m\,p) = {^\p\^P). 

5. F{p ®a,p' ® a') = F{p, p')F{a, a') 

6. F{p,ppi + il-p)p2) > pF{p, pi) + {l-p)F{p, p2), 
< p < 1. 

In references the unknown state was known to 

be pure, 6=1, but no knowledge of the direction of 
the Bloch vector was assumed. In reference Q the un- 
known state was a mixed state drawn stochastically from 
a known isotropic distribution /(5), and although the 
best guess pi depended on /(5), the optimal measuring 
strategy, that is the set {Mi} of positive operators of the 
different outcomes, did not. For isotropic distributions 
optimal measurements are thus distribution, i.e. /(&), 
independent. 

However, proposing an outcome-dependent guess and 
evaluating its quality through the fidelity is only one of 
the criteria that could have been used to define optimal 
measurements. A sound alternative, the one we shall 
investigate in this work and probably the most sensi- 
ble choice in the context of quantum information theory, 
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consists of quantifying the quality of measuring strategies 
through the gain of information about the unknown state. 
In fact, information theory aheady supphes a universally 
accepted, unambiguous scheme for this purpose, that we 
shall follow. It is based on Bayes formula, which pro- 
vides a conditional (outcome-dependent), posterior dis- 
tribution fc(b\i) from the (here isotropic) prior distribu- 
tion f(b), and on the KuUback, which quantifies the gain 
of information acquired when replacing f(b) with fcib\i)- 
More specifically, if Pi{b) = Tr[p(6)Mj] is the probabil- 
ity of outcome i when the unknown state is p(b) and 

Pap{t) = I d^b f{b)P,{b) {J d^b f{b)^l) (5) 

is the a priori probability of outcome i, then Bayes for- 
mula states that the posterior distribution fcib\i), the 
one which collects our knowledge about the unknown 
state p{b) after measuring when the initial knowledge was 
given by /(6), reads 



mi) 



f{b)P^{b) 

Pap{i) 



(6) 



The gain of information about p{b), A/, is then given, in 
bits, by the KuUback of fc{b\i) relative to f{b) 



if, [/,//] ^ / d%fM^)^og^ 



fcm 

fib) ■ 



(7) 



This expression, the only one satisfying a series of intu- 
itively reasonable conditions ||ll| , is well-defined for con- 
tinuous distributions (it has no dependence on the mea- 
sure in the space of quantum states) and its average over 
possible outcomes. 



K[fJf] = Y,Pap{i)K^[fc/fl 



(8) 



is precisely the difference of the a priori and average a 
posteriori entropies H of the corresponding probability 
distributions of states, 



H[f]-H[f,] = -Jd% f{b)log^f{b) 

+ J2P-p(') I d'b fcimiog^fM 



(9) 



as can be checked by considering eqs. (1^^ and that 
^jPi(&) = 1 This quantification is therefore equiva- 
lent to the one already used in previous works on quan- 
tum state estimation with discrete distributions (see, e.g., 
ref. §). 

First, the question of which are the optimal measure- 
ments according to this information theoretically based 
criterion will be addressed. We will check explicitly for 
iV = 1 and N — 2, and provide clues for any N, that 



optimal -and also minimal- measuring strategies are uni- 
versal, i.e. independent of whether the fidelity or the in- 
crease of information is used for their quantification, and 
will compute the corresponding optimal gain of informa- 
tion A/. Then we will move to consider which is the 
isotropic prior /(6) for which optimal measurements ex- 
tract most information, so that it corresponds to the op- 
timal (isotropic) quantum channel of information. After 
introducing a reversible compression procedure we con- 
clude that the optimal amount of extractable information 
is, as iV ^ cxD, of one bit per effective qubit isotropic dis- 
tributions. 

In order to find an optimal measuring strategy, i.e. a 
set of operators Mi as in eq. (|^) maximizing the gain of 
information (eq. (H)), the following theorem and subse- 
quent corollaries, valid for any number of copies N, will 
be very useful. 

Theorem: Let the positive operator Mi > be such that 
its probability Pi{b) = Tr[Mi/9(6)®^] can be written, for 
any 6, as the sum of two contributions of the form 

P,,fc(6) =Tr[M,,fcp(&)®^], fc= 1,2, 

where the operators Mi^i,Mi^2 are also positive (and 
Mi^i + Mi^2 is not necessarily equal to Mi). Let us in- 
troduce corresponding prior probabilities Pap{i,k) and 
posterior distributions fc{b\i,k) as in eqs. (^) and (||). 
Then, 



Papi^)K,[fc/f] < ^/'ap(*,A;)if,,fc [/,//] 



(10) 



k=l 



Proof: It follows from the inequality 

{xi -|- X2) In — — - < In — -|- a;2 In — 

yi + 2/2 yi y2 

V xi,x2,yi,y2 > 0. □ 



(11) 



Corollary 1: An optimal measuring strategy with rank- 
one operators always exists, (cf. ]l2[) 

Proof: Indeed, suppose J2i ^'^i — ^ corresponds to an 
optimal measurement. Then, if Mj ~ |i, k){i, k\ is the 
spectral decomposition of Mi , it follows from the theorem 
that the rank-one POV measurement J^i k K ^X*' ^1 = 1 
is also optimal. □ 

We can already consider the case = 1, that is, when 
only one copy of the unknown state is available. One can 
convince oneself immediately that an optimal (and also 
minimal) measurement is just a standard von Neumann 
measurement. In fact, any will do because of the isotropy 
of f{b). Suppose that we measure a^. Then, for b = 
(6 sin 9 cos 0, 5 sin sin </>, b cos 9) , we have 



/e(5|±) = (l±6cos0)/(6) 



(12) 
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and the gain of information is 



TT / db h^f{h) 

^2 



l0g2(l-&) 



l0g2(l + &)- 

logae 



(13) 



The function in square brackets in eq. (13) is niono- 
tonically increasing, so that the distribution for which 
the absolute increase in knowledge is maximal is 



fi'Xb) = ^S{b-l) 



1 



(14) 



i.e. an isotropic distribution of pure states. 

It is interesting to point out that if instead of using in 
ref. Q the mean average fidelity F^^^ we had used the 
mean average increase in fidelity, 



J. J. , 



(15) 



with the optimal guess po = p(0) if no measurement is 
performed, so that 



with (cf. H) 



ap 



in 



1 



h/2 = 



db b^f{b)i- 



r 



(/q = 1,/q > 4/a-j,i), we would have obtained 
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(1 - 4/i)2 - /i/2 



(16) 



(17) 



(18) 



It is then easily verified that the maximum value of AF(i) 
also corresponds to the distribution eq. (p^). Thus, for 
N=l, quantifying with the fidelity or with the KuUback 
information leads to the same (for N=l somewhat obvi- 
ous) optimal and minimal measuring strategy and to the 
same distribution which maximizes A/(i) and AF(i). Is 
this also true for = 2? 

In order to answer this question we need to present 
a second corollary. Notice first that with the following 
notation (borrowed from |^) for the composite Hilbert 
space of N copies of the unknown state p{b), 



for the corresponding local spin operators, 



(19) 



Sa 
Sb 



1 ^ 
—a ( 
2 

1 

2^^ 



j(g>N-l 



and for the partial and total spin operators, 

a 

"^(a) = X/ "^Z^' 0! = ^jB, - ■ ■ ,N; S = S(^a=N), 
/3=A 

the following spin invariances hold a: 



a = A, 



(21) 



(22) 



q2 qI 



and since 

Va,/3 (23) 
the total Hilbert space can be written as a direct sum 

7^W=e(.,„,}i?{,,„,| (24) 

where E^^^^^^ are the simultaneous eigenspaces of all the 
operators S'^^y'ia ^ A, with corresponding eigenvalues 
{sa{sa + l)}j ordered with decreasing a (see for more 
details). For instance, for TV = 2 only S'?^^ (s(b)) is 



relevant, i.e. E, 
reads 



{-(.)} 



Esi^g^ , and the decomposition 



(25) 



where Ei is the triplet or symmetric (under exchange of 
copies) subspace, with total spin s = s^^) = 1, whereas 
Eq is the singlet or antisymmetric subspace, with total 
spin s = 0. Then, 

Corollary 2: There always exists an optimal measur- 
ing strategy consisting only of rank-one operators of the 
form |{s(a)})({s(Q)}|, where the not necessarily normal- 
ized vector |{s(q)}) is an eigenvector of all partial and 
total spin operators, i.e. 

S'c/s) = S(^)(5(/3) + l) V/?, (26) 

and thus it belongs to the subspace E^^g^^^^. 

Proof: Let Mi = 1 correspond to an optimal mea- 
surement with rank-one operators Mi = (where the 
\i) do not need to be orthogonal nor normalized) and let 
^{sa} = } be a projector onto the whole subspace 
Es^sa,}- Then it follows from eq. ( p^ ) that 



a, 



(27) 



so that if we replace 1 with Mi in the LHS of this 
equation, we obtain a new measurement 

E N,{sa})(i,K}| = l; =n{,„}|z). (28) 

(20) Now, since eq. (|2|) imphes that for each 
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Tr[p{b) 



E Tr[p(6) 



h {sa}){h {sa}\], (29) which after integration of (f> gives 



the theorem guarantees that the measurement of eq. (|2S 
is also optimal. □ 

[Notice that exactly the same conclusion was also 
achieved, for any N, when the fidelity was used as a 
criterion for optimality this being indicative of the 
universality we are considering here.] 

Thus, in order to find an optimal measuring strategy 
for N = 2 we can always choose the pure states on which 
the measurement projects to be symmetric or antisym- 
metric under the exchange of the two qubits. Let us next 
compute A/'^' for the optimal strategy of ref. that is 
corresponding to a resolution of the identity of the form 



t = w){<7\ + -j2{\h,){h,\f', 



(30) 







2(1 -/i) 
(36) 



This is a function of p that we want to maximize. Only 
k \og2{k + \/k'^ — depends on p. The part —P is max- 
imized for p = and p = 1. The other part too, as one 
can see easily neglecting the term P. Thus A/^^-* is max- 
imized when \ip) is a product state and the resolution of 
eq. ( ^0| ) is indeed optimal. 

As we did for = 1, it is interesting to recall, with the 
help of ref. p] , the average increase in fidelity for N ^ 2 



(/i - h 



^(1-4/r)^ 



/3 - II 



(37) 



where \a) is the (normalized) singlet state, a ■ n\n) — \n) 
{{n\n) = 1) and the four unitary vectors hi point to the 
four directions of the vertices of a regular tetrahedron. 
One readily obtains 



fc{b\(7) = — - — -^-^ ; Papi'y) = h 



Papier) 
16 Pap[n) 



(31) 



1 



Pap(ri) = -(l-/i) (32) 



so that 

A/(2) =71 [ db b^f{b) 
Jo 

(1-6)3 



(1 + 6)= 



log2(l + 6) 



log2(l-6) + (l-6^)log2(l-6^) 



(l-/i)(^^+log2^)-/ilog2/i-2. (33) 

Can we do better, i.e. is there another resolution of the 
identity which leads to a larger A/^^^^? Let us prove that 
there is none. Because of corollary 2, the whole question 
boils down to whether symmetric entangled states could 
do better than the symmetric product states \hi) \hi) used 
in eq. ( ^o| ) . Consider therefore a general symmetric state 
of Schmidt decomposition 



l^> = \^l+>l+> + V^|->|->, pe[0,l], (34) 

where the isotropy of f(b) has been taken into account in 
choosing the basis. One can readily obtain the average 
Kullback information corresponding to this state. 



A/i^) = i 



db b^fib) 



/ d(j) dn /ll0g2 YTJ-: 

Jo J -I 



One can now check that both A/'^^ and AF^^^ are again 
maximized for the distribution eq. (14). For A/^^^ this 
follows by observing that the part in square brackets of 
eq. (^ is an increasing function of b and that the other 
part, which depends on Ji, increases as /i goes towards 
zero. 

We have thus checked for = 1 and N = 2 that 
both the fidelity and the Kullback information lead to 
the same optimal measuring strategy and to the same, 
pure state, distribution which maximizes their increases. 
We conjecture, while not foreseeing any feature which 
could jeopardize extending the proof to A^ > 2, that the 
universality of optimal measurements holds for any num- 
ber TV of copies of the unknown state Corollary 2 
makes this conjecture very plausible. The precise optimal 
strategy is in fact determined to a great extend by the 
isotropy of the prior distribution, the symmetries of the 
state p{b)'^^ which allow to choose each positive opera- 
tor Mi to act only on one of the subspaces £'{s(a)}j ^^nd 
the fact that both the fidelity and the Kullback favour 
strategies with outcomes i whose normalized probabil- 
ity of occurrence Ti[p{b)'^ Mi]/Ti[Mi] spans the largest 
possible range as a function of the direction of b. 

Now, suppose we want to use the A^ qubits as a quan- 
tum channel of classical information. Alice prepares N 
copies of a given state p{b) (the classical information be- 
ing encoded in the vector b) and sends them to Bob, who 
will perform a collective measurement in order to recover 
as much information about b as possible. The previous 
results single out using, when restricted to isotropic prior 
distributions, only pure states (6 = 1) to encode classical 
information as the optimal method. We can then easily 
compute the optimal capacity of this isotropic quantum 
channel for any A^, to find that 



h=k + lcos2(t>, 1 = 2y^p{l-p)b^{l- fl^), 
k=l + b^fi^ + {2p - l)2bfj., (35) 



A/W =log2(A^ + l)- 



A^ 



(N + l) 



log2 e, 



(38) 
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which for large N gives — bits carried per qubit. No- 
tice that this is a purely quantum channel, no additional 
flow of classical information being required at any stage. 
Its poor capacity can be exponentially enhanced with- 
out spoiling this fact if we take into account that a pure 
state </)®^ belongs to the symmetric subspace 5^^^ of the 
whole Hilbert space Since the dimension of 5^^-* 

is N +1, which corresponds to the dimension of a Hilbert 
space H^*^^ of AI = log2(A^-l- 1) qubits, Alice can always 
compress, by means of a state- independent, unitary (and 
thus fully reversible) transformation, the state 0*^^ to 
fit in M qubits, that will then be transferred to Bob. In 
this case the capacity increases up to 1 — O( iog jy ) bits 
per qubit, which is asymptotically the classical one (as 
expectable, since for any two inequivalent states cf) and 
(j)'^^ and (j)"^^ become orthogonal as — > oo), and 
which is consistent with the Levitin-Holevo bound ||l^] 
for the classical capacity of a quantum channel. 

Summarizing, using the gain of information as a guide 
we have constructed optimal and minimal measurements 
on N = 1,2 identical copies and have shown that for 
isotropic distributions the maximal gain of information 
is achieved for pure states. Also universality of opti- 
mal measurements has been proven, since these measure- 
ments exactly coincide with those obtained in previous 
work, where the fidelity was taken as figure of merit. We 
conjecture that also for A^ > 3 the most informative mea- 
surements are the most faithful ones, and vice versa. 
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